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Abstract 

We consider the logic mso+U, which is monadic second-order logic 
extended with the unbounding quantifier. The unbounding quantifier is 
used to say that a property of finite sets holds for sets of arbitrarily large 
size. We prove that the logic is undecidable on infinite words, i.e. the 
MSO+U theory of (N, <) is undecidable. This settles an open problem 
about the logic, and improves a previous undecidability result, which used 
infinite trees and additional axioms from set theory. 


1 Introduction 

A celebrated result of Biichi is that the monadic second-order MSO theory is 
decidable for the structure of natural numbers with order 

(N,<). 

In other words, MSO is decidable for infinite words. This paper shows that the 
decidability fails after MSO is extended with the unbounding quantiher. The 
unbounding quantifier, denoted by 

UA. ifiiX), 

says that (p{X) holds for arbitrarily large finite sets X. As usual with quantifiers, 
the formula y(A) might have other free variables except for X. Call MSO+U 
the extension of MSO by this quantifier. The main contribution of the paper is 
the following theorem. 

Theorem 1.1 The MSO+U theory of (N, <) is undecidable. 

Background. The logic MSO+U was introduced in [B5j04l , where it was 
shown that satishability is decidable for formulas on infinite trees where the 
U quantifier is used once and not under the scope of set quantihcation. The de¬ 
cidability result from |Boj04| straightforwardly entails decidability of the finite 
model problem for modal /r-calculus with backward modalities. A signihcantly 
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more powerful fragment of the logic, albeit for infinite words, was shown decid¬ 
able in |BC06j using automata with counters. These automata where further 
developed into the theory of cost functions initiated by Colcombet in |Col09j . 
The decidability result from |BC06j straightforwardly entails decidability of the 
star height problem. 

The difficulty of MSO-I-U comes from the interaction between the unbound¬ 
ing quantifier and quantification over possibly infinite sets. This motivated the 
study of WMSO-I-U, which is the variant of MSO-I-U where set quantification is 
restricted to finite set. On infinite words, satisfiability of WMSO-I-U is decid¬ 
able, and the logic has an automaton model |Bojll| . Similar results hold for 
infinite trees [BT12| . The results from |BT12) have been used to decide prop¬ 
erties of CTL* |CKL13) . Currently, the strongest decidability result in this line 
is about WMSO-I-U on infinite trees extended with quantification over infinite 
paths |Bo j 14| . The latter result entails decidability of problems such as the re- 
alisability problem for prompt ltl [KPV09] . deciding the winner in cost parity 
games |FZ12| . or deciding certain properties of energy games |BCKN1^ . 

While the above results showed that fragments MSO-I-U can be decidable, 
and can be used to prove results not directly related to the logic, it was not 
known if the full logic was decidable. The first evidence that MSO-I-U can be too 
expressive was given in [HEU, where it was shown that MSO-I-U can define lan¬ 
guages of infinite words that are arbitrarily high in the projective hierarchy from 
descriptive set theory. This result was used in |BGMS1^ . where it was shown 
that, modulo a certain assumption from set theory (namely V=l), the MSO-I-U 
theory of the complete binary tree is undecidable. The result from [BGMS14] 
implies that there can be no algorithm which decides MSO-fU on the complete 
binary tree, and which has a correctness proof in the ZFC axioms of set theory. 
This paper strengthens the result from |BGMS1^ in two ways: first, we use no 
additional assumptions from set theory, and second, we prove undecidability for 
words and not trees. 


2 Vector sequences 

Define a number sequence to be an element of N“, and define a vector sequence 
to be an element of (N*)“, i.e. an infinite sequence of vectors of natural numbers 
of possibly different dimensions. We write f, g for vector sequences and /, g for 
number sequences. If / is a number sequence and f is a vector sequence, then 
we write / G f if for every position i, the i-th number in the sequence / appears 
in one of the coordinates of the i-th vector in the vector sequence f. Number 
sequences are called asymptotically equivalent if they are bounded on the same 
sets of positions. A vector sequence f is called an asymptotic mix of a vector 
sequence g if every / G f is asymptotically equivalent to some g G g. A vector 
sequence of dimension d is one where all vectors have dimension d. 

In the proofs below we use the following definition: two vector sequences are 
asymptotically equivalent if they have the same dimension d, and for each co¬ 
ordinate i G {!,...,(!} the corresponding number sequences are asymptotically 
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equivalent. 

Lemma 2.1 Let (i € N. There exists a vector sequence of dimension d which 
is not an asymptotic mix of any vector sequence of dimension d — 1. 

Proof 

The definition of asymptotic mix does not use the order structure of natural 
numbers, and therefore in the proof of this lemma we allow sequences to be 
indexed by other countable sets, namely vectors of natural numbers. By induc¬ 
tion on d, we will prove the following claim about vector sequences indexed by 
N'^. We claim that the d-dimensional identity 

id : ^ N^, 

is not an assymptotic mix of any vector sequence 

g . ^ ^d-i^ 

The induction base of d = 1 is vacuous. Let us prove the claim for dimension d 
assuming that it has been proved for smaller dimensions. 

Toward a contradiction, suppose that the d-dimensional identity is an asymp¬ 
totic mix of some g : —)■ Consider the subset of arguments {0} x 

The first coordinate of the d-dimensional identity is bounded on this subset, 
namely it is zero, and therefore there must be some g € g which is bounded on 
this set. Without loss of generality, we assume that the first coordinate of g is 
bounded on arguments from {0} x Let 

g' : pjd ^ f^d-2 

be the vector sequence obtained from g by removing the first coordinate. Let 
TTi : —>■ N with z € {2,..., d} 

be the projection onto the z-th coordinate, which satisfies tt^ G id. Therefore, 
each TTi must be asymptotically equivalent to some gi G g. Let W C be the 
set of arguments x where gi agrees with the first coordinate of g. In other words, 
when restricted to arguments outside Xi the projection is asymptotically 
equivalent to some gi G g'. Since the first coordinate of g is bounded on the set 
{0} X it follows that there is some cz € N such that Xi does not contain 

any arguments which have zero on the hrst coordinate zero and at least Ci on 
the z-th coordinate. Taking c to be the maximum of all C 2 ,..., c^, we see that 
none of the sets X 2 ,. ■ ■ ,Xd intersect the set 

X = {(0,712, ■■■,nd) :n2,...,nd>c}. 

It is easy to observe that the vector sequence 

(0,772,... ,nd) e X (7Z2, ... ,rzd) (I) 
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is an asymptotic mix of g', which is a vector sequence of dimension d — 2. 
This contradicts the induction assumption, because the vector sequence in Q 
is asymptotically equivalent to the {d — l)-dimensional identity. □ 

A vector sequence is said to have bounded dimension if there is some d such 
that all vectors in the sequence have dimension at most d. A vector sequence is 
said to tend to infinity if for every n, all but finitely many vectors in the sequence 
have all coordinates at least n. We order vector sequences coordintewise in the 
following way: we write f < g if for every i, the i-th vectors in both sequences 
have the same dimension, and the i-th vector of f is coordinstewise smaller or 
equal to the i-th vector of g. A corollary of the above lemma is the following 
lemma, which characterises dimensions in terms only of boundedness properties. 

Lemma 2.2 Let fi, f 2 be vector sequences of bounded dimensions which tend to 
infinity. Then following conditions are equivalent 

1. on infinitely many positions fi has a vector of higher dimension than £ 2 ; 

2. there exists some gi < fi which is not an asymptotic mix of any g 2 < f 2 - 

Proof 

Vector sequences that tend to inhnity are maximal with respect to asymptotical 
equivalence in the following sense: if a vector sequence f of fixed dimension 
d tends to infinity, then for every vector sequence h of same dimension there 
exists an asymptotically equivalent vector sequence g < f (to obtain such g, on 
each coordinate of each position we can take the minimum of the two numbers 
appearing in this place in f and h). A corollary of this observation is that 
if {2 is a vector sequence of bounded dimension which tends to infinity, then 
every vector sequence at each (or at each except finitely many) position having 
dimension smaller or equal to the dimension of £2 is an asymptotic mix of some 
g 2 < f 2 . This corollary gives the right-to-left implication in the lemma. 

For the left-to-right implication, we use Lemma |2.1[ Let di be such that 
on an infinite set X C N of positions fi has dimension di and /2 has a smaller 
dimension. By Lemma |2.1[ there is a vector sequence 

h : X ^ 

of dimension di which is not an asymptotic mix of any vector sequence of smaller 
dimension. As we have observed, h is asymptotically equivalent to some gi < 
fi (when restricted to positions from X), because fi tends to infinity on all 
coordinates. Therefore, gi is not an asymptotic mix of any g 2 < f 2 on X, since 
such a vector sequence g 2 has strictly smaller dimension. We can arbitrarily 
extend gi to all positions outside of X, and still it will not be an asymptotic 
mix of any g 2 < f 2 . D 
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3 Encoding a Minsky machine 

We now use the results on vector sequences from the previous section to prove 
undecidability of MSO+U. To do this, it will be convenient to view an infinite 
word as a sequence of finite trees of bounded depth, in the following sense. 
Consider a word 


we {1,2,3,...,nr 

which has infinitely many Ts. We view such a word as an infinite sequence of 
trees of depth n, denoted by tree(r(;), as described in Figure 




Figure 1: An example of tree(w) for n = 3. Formally speaking, the the leaves 
of tree(r(;) are positions with label n, while the tree structure is defined by the 
following rule. Two leaves which correspond to positions x and y with label n 
have a common ancestor at depth i if and only if there is no position between x 
and y which has label in {1 ,... ,i\. In particular, if between x and y there is a 
position with label 1, then x and y are in different trees of the sequence. Note 
that the mapping w i—tree(?ii) is not one-to-one, e.g. in the picture, the first 2 
just after the first 1 could be removed from w without affecting tree(ri;). 

The key to the undecidability proof is the following lemma, which says that, 
in a certain asymptotic sense, degrees can be compared for equality. Here the 
degree of a tree node is defined to be the number of its children. 

Lemma 3.1 There is an MSO+U formula, which defines the set of words 

we{l,2,3r 

which have infinitely many 1 ’s and such that tree(t(;) has the following properties: 

(a) the degree of depth 2 nodes tends to infinity; 

(b) all but finitely many nodes of depth 1 have the same degree. 

Proof 

Condition 0 is easily seen to be expressible in MSO+U. One says that for every 
set of depth 2 nodes, their degrees are unbounded. 
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Let us focus on condition 0. Fix a word w with infinitely many I’s as in 
the statement of the lemma. For an infinite set X of depth 1 nodes, define 

fx : N ^ N* 

to be the vector sequence, where the *-th vector is the sequence of degrees of 
the children of the i-th node from X. Condition 1 says that if X is the set of 
all depth 1 nodes, then ix tends to infinity, which implies that fx also tends to 
infinity for any other infinite set X of depth 1 nodes. 

Call two sets X, Y of depth 1 nodes alternating if every two nodes in X are 
separated by a node in F, and vice versa. Condition 0 is equivalent to saying 
that 

• depth 1 nodes have bounded degree; 

• one cannot find infinite alternating sets X, Y of depth 1 nodes, such that 
infinitely often fx has strictly bigger dimension than fy. 

The first condition is clearly expressible in MSO+U, while the second is express¬ 
ible in MSO-I-U thanks to Lemma □ 


Minsky machines. To prove undecidability, we reduce emptiness for Minsky 
machines to deciding MSO-I-U. By a Minsky machine we mean a (possibly non- 
deterministic) device which has a finite state space, and two counters that can 
be incremented, decremented, and tested for zero. It is undecidable if a given 
Minsky machine has an accepting run, i.e. one which begins in a designated 
initial state with zero on both counters, and ends in a designated final state. 

Let p be a finite run of a Minsky machine of length d. We say that a vector 
of natural numbers (ui,..., 712 ^) describes the run p if, for i = l,...,d, the 
numbers n 2 i-i,n 2 i store the value of the two counters in the i-th configuration 
of p. Note that this description does not specify fully the run p, as the state 
information is missing. The following lemma contains the reduction of Minsky 
machine emptiness to satisfiability of MSO-I-U. 



Figure 2: A sequence of trees as in Lemma 3.2 Here d = 2, ni = 3, and 77,2 = 2 
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Lemma 3.2 For every Minsky machine, one can compute a formula o/MSO+U 
which defines the set of words 


we{l,2,3,Ar 

which have infinitely many 1 ’s and such that tree(w) has the following properties, 
which are illustrated in Figure^ 

(a) the degree of depth 3 nodes tends to infinity; 

(b) all but finitely many depth 1 nodes have the same degree d; 

(c) for every i G {1, ■ ■ • ,d}, all but finitely many depth 2 nodes which are an 
i-th child have the same degree, call it Ui; 

(d) ni — 1,..., rid — 1 describe some accepting run of the Minsky machine. 


Proof 

Condition 0 is clearly expressible in MSO+U. 

We say that a sequence of trees of depth 3 is well-formed if the degree of 
depth 2 nodes tends to infinity, and that it has almost constant degree if all 
but finitely many depth 1 nodes have the same degree. Lemma |3.1| says that 
MSO+U can express the conjunction of being well-formed and having constant 
degree. We will use this property to define conditions Q, Q and @- 

Define the flattening of tree(w) to be the sequence of depth 3 trees obtained 
from tree(w) by removing all depth 3 nodes and connecting all depth 4 nodes 
directly to their depth 2 grandparents. By condition 0, the flattening is well- 
formed. Since the flattening does not change the degree of depth 1 nodes, 
condition 0 is the same as saying that the flattening has almost constant 


degree, and therefore can be expressed in MSO+U thanks to Lemma 3.1 


Define a depth 2 selector with offset f to be a set of nodes X in the tree 
tree(r(;) which selects exactly one child for every depth 1 node (and therefore 
X contains only depth 2 nodes), and all but finitely many nodes in X are an 
+th child. A depth 2 selector, without i being mentioned, is a depth 2 selector 
for some i. Being a depth 2 selector is equivalent to saying that one gets a 
well-formed sequence of almost constant degree if one keeps only nodes from 
and their descendants, where is the set of nodes of depth 2 that have a 
sibling from X to the right. Therefore, being a depth 2 selector is definable in 
MSO+U. Condition Q is the same as saying that for every depth 2 selector X, 
if one only keeps the nodes from X and their descendants, then the resulting 
sequence has almost constant degree, which can be expressed in MSO+U thanks 
to Lemma [3. II 

We are left with condition 0 about Minsky machines. We say that a depth 
2 selector X represents zero, if all but finitely many nodes in X have degree 
one (recall that condition 0 uses Ui — 1 to represent a counter value, because 
a depth 2 node cannot have degree zero). Representing zero is definable in 
first-order logic. If X, Y are selectors, we say that Y increments X if there is 
some n such that all but finitely many nodes in X have degree n, and all but 
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finitely many nodes in Y have degree n + 1. This is equivalent to saying that 
if one keeps only nodes from X UY and their descendants, and then removes 
one subtree of every node from Y, then the resulting sequence of depth 3 trees 
has almost constant degree. Therefore incrementation is definable in MSO+U. 
Using formulas for representing zero and incrementation, it is easy to formalise 
condition 0 in MSO+U (the formula first guesses the missing state information 
to fully specify the run p, and then verifies its consistency with the Minsky 
machine). □ 

In particular, the formula computed in Lemma |3.2| is satisfiable if and only if 
the Minsky machine has an accepting run. This yields undecidability of MSO+U 
on infinite words, which is the same as our main Theorem o A corollary of 
the main theorem is undecidability of the logic MSO+inf, which is a logic on 
profinite words defined in [Tor 12) . because decidability of MSO+U reduces to 
decidability of MSO+inf. 
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